DOCUMENT RESUME 



ED 043 222 



EH 008 361 



AUTHOR 

TITLE 

INSTITUTION 

SPONS AGENCY 

REPORT NO 
PUB DATE 
NOTE 



Bond, Nicholas A., Jr.: Rigney, Joseph W. 

Measurement of Training Outcomes. 

University of Southern California, Los Angeles. 

Dept, of Psychology. 

Office of Naval Research, Washington, D.C. Personnel 
and T ra i n i n g Branch. 

TR-66 
Jun to 
U 9p. 



EDRS PRICE EDRS Price KF-50.25 HC-* 2. 55 

DESCRIPTORS Achievement, ♦Education, Evaluation, ♦Measurement, 

♦Research Methodology, Social Change, Training, 
Training Objectives, Training Techniques 



ABSTRACT 

Measurement of training outcomes is a requirement 
for evaluating new training techniques, but is one that is different 
to meet. Managers of education and training may have different 
concepts of what they want, as favorable outcomes, than do the 
investigators doing the research, classical statistical and 
experimental designs assume laboratory rigor of control over 
variables that is seldom possible in the real world of a school or 
classroom. Yet in the broader perspective of educational 
institutions, the effectiveness of the instituti >ns is a current 
issue of fundamental concern in our society. In this report, 
possibilities for measuring outcomes of training are surveyed, 
considering training as a form of planned social change. Various 
approaches are discussed. Illustrations from the computer-assisted 
instruction (CAT) literature of recent attempts to measure training 
outcomes are given. The principal conclusions presented are that the 
classical four-vay design is impracticable for most evaluation 
studies in training environments; that a policy of "adaptive research 
for big effects" is apt to be scientifically and administratively 
desirable: mud that current attempts at measurement of training 
outcomes sill use fairly simple methods. (Author) 




)°)Z 8oov. 



BEHAVIORAL 

TECHNOLOGY LABORATORIES 



r\J 
r\ J 
(NJ 

- 4 * 



o 

UJ 



Technical Report No. 66 



MEASUREMENT OF TRAINING OUTCOMES 



June 19 >0 



Department of P*y<hotogy 
University of Southern California 



This document his been approved for public release and ulej 
its distribution is unlimited. Reproduction in whole or in part 
it permitted for any purpose of the United States Government. 




EDO 43222 



i 



■ t ^ ( 



t 



# $ NHITWIl W KUUH. IDIKIMI ( WHIM 
WIKI W ftlKitlOl 



THIS DKIIKU1 KiJ ltd KIWCIK19 UKM il ICCtlVtO IKK THE 
FEtSOH 01 OKimiUtM OriWMIlK H. KKHJ Of m 01 OMItOll 
sum M Ml IKEUiMT K9HHI1 WIKlll WIKI W lOUatKHI 
MUtlOl 01 KUCf. 



DEPARTMENT OP PSYCHOLOGY 
UNIVERSITY OF SOUTHERN CALIFORNIA 



Technical Report No. 66 
MEASUREMENT OF TRAINING OUTCOMES 



June 1970 



Nicholas A. Bond, Jr. 
Joseph W. Rigney 



Prepared 

Personnel and Training Research Pregraft* 
Psychological Sciences Division 
Office of Naval Research 

Contract N00014*67*A*0269*0012 
Contract Authority Identification No. NR 154*295 

Reproduction in whole or in part is permitted 
for any purpose of the United States Governaent 




THIS DOCUMENT HAS BEEN APPROVED FOR PUBLIC 
RELEASE AMD SALE; ITS DISTRIBUTION IS UNLIMITED 



I 



Unclassified- 



Security CI**>ific<tion 



OOCUMEHT CONTROL DATA • R & D 

tStturitr t }Anltl< jMo/i cl IMF*. body of jbnr«rf Arvd ind**tn£ ^nofaif on mu.l bt anuitd »»h»n l/»« ovffja tepoit fa <■ f* 



i on'CinatinC ac Tl vi r r (Cerportl* suthet) 

Behavioral Technology Laboratories 
University of Southern California 
Los Angeles, California 90007 



ia.RiroRT ItCySiTr C i 4 lirf iC a tm3n 

Unclassified 



2b. CROUP 



t AtPOAY TlT 1C. 



MEASUREMENT OF TRAINING OUTCOMES 



4 OIICRip TivK NOTCI (Typ* bt tip** I And Inttutim dsttt) 



Technical Report 66 



June 1970 



t au f HOP'D frfraf Mm*, iiMi initlAi, t*un$mt) 

Nicholas A. Bond, Jr. 
Joseph W. Rlgney 



I P C POP T O A Tl 



June 1970 



?j. tttu no. or rito 



34 



ik. no or nif, 

25 



CONIpACt OP CPtNT NO 

N00016-67-A-0269-0012 

6. PPOJIC t NO 

NR 154-295 



fa. OPlOlNATOP‘1 PtPOPT NWWetPill 



Technical Report 66 



tb. o t h cp PC pop T nO'I : {Any afhD rum >«/* rhar mar t* aiiimi* 
(Ml t*p**t) 



16 OilTPrtJTlON ITATfwtNT 



THIS DOCUMENT HAS 3EEN APPROVED FOR PUBLIC RELEASE AND SALE; 
ITS DISTRIBUTION IS UNLIMITED 



n. iu»pl (vin r ap f nOpci 



I / IPONIO PiNd MU. i T A P V ACtMTiT* 



Personnel and Training Research Programs 
Psychological Sciences Division 
Office of Naval Research 



T rrprrrrr 



Measurement of training outcomes is a requirement for evaluating 
nev training techniques! but is one that Is difficult to meet. Managers 
of education and training may have different concepts of vhat they vant, 
as favorable outcomes, than do the investigators doing the research. 
Classical statistical and experimental designs assume laboratory rigor 
of control over variables that is seldom possible In the real world of 
a school or classroom. Vet In the broader perspective of educational 
Institutions, the effectiveness of these Institutions la a current 
issue of fundamental concern in our society. In this report, possi- 
bilities for measuring outcomes of training are surveyed, considering 
training as a form of planned social change. Approaches vhlch are dis- 
cussed Include the classic Solomon four-group design, Iterative adapta- 
tion to the peculiarities of Individual student progress, response sur- 
face designs, adaptive control models, decision theoty models, and sim- 
ulation models. Illustrations from the CAt literature ol recent 
attempt* to measure training outcomes are given. Ihe principal con- 
clusions presented are that the classical four-vay design Is Imprac- 
ticable for most evaluation studies in training environments; that a 
policy of "adaptive research for big effects" Is apt to be scientifical- 
ly and administratively desirable; and that current attempts at measure- 
ment of training outcomes still use fairly simple methods. 



DD/r,J473 

sa o'Ci. io>. tioi 



(PAGt U 



Vncl»»*if led 



O 

ERIC 



Slrthtr CU**ifiiiTi m 



Unclassified 



Security Cti »TU lotion 



1 4 

nr.' r NOKOI 


L IhlPt A 


LINKS 


LINK C 


note 


N T 


40U 


*T 


«OL C 


* T 


Training Outcomes 
Measurement of Change 
Experimental Designs 
Adaptive Models 
Compute r-atded Instruction 
Instructional Technology 















DD .nnil473 ,e * CK| Unclassified 

(fAG€ 21 SttSSSfCUtt ifkSiiST 




I 



ACKNOWLEDGMENTS 



This report Is a product of a continuing research program sponsored 
by the Personnel and Training Research Programs Branch of the Psycho- 
logical Sciences Divisions, Office of Naval Research. The support, 
encouragement, and patience of Dr. Victor Fields and Dr. Glenn L. Bryan 
are gratefully acknowledged. 

Portions of this report were given as a lecture at an Advanced 
Study Institute, sponsored by NATO, at the Royal Naval College, Green- 
wich, England, In April 1970. 



ABSTRACT 



Measurement of training outcomes Is a requirement for evaluating 
new training techniques, but Is one that Is difficult to meet. Managers 
of education and tralnlug may have different concepts of what they want, 
as favorable outcomes, than do the Investigators doing the researcn. 
Classical statistical and experimental designs assume laboratory rigor 
of control over variables that Is seldom possible In the real world of 
a school or classroom. Yet In the broader perspective of educational 
Institutions, the effectiveness of these Institutions Is a current 
Issue of fundamental concern In our society. In this report, possi- 
bilities for measuring outcomes of training are surveyed, considering 
training as a fora of planned social change. Approaches which are dis- 
cussed Include the classic Solomon four-group design, Iterative adapta- 
tion to the peculiarities of Individual student progress, response sur- 
face designs, adaptive control models, decision theory models, and sim- 
ulation models. Illustrations from the CAI literature of recent 
attempts to measure training outcomes are given. The principal con- 
clusions presented are that the classical four-way design Is Imprac- 
ticable for most evaluation studies In training environments; that a 
policy of "adaptive research for big effects" Is apt to be scientifical- 
ly and administratively desirable; and that current attempts at measure- 
ment of training outcomes still use fairly simple methods. 




*1* 



TABLE OF CONTENTS 



Section Page 

I. INTRODUCTION I 

II. SPECIFIC PERFORMANCE MEASURES 3 

Gain Scores A 

Number Solved vs. Process Scores 7 

Time to Criterion 7 

Error Rate 8 

Persistence Measures 8 

Transfer Measures 9 

Time vs. Achievement 12 

Retention Measures 12 

Remarks 13 

III. COMPARATIVE DESIGNS FOR EVALUATING TRAINING 14 

Responses ur face Designs 20 

Adaptive Control Models 21 

Decision Theory Models 22 

Simulation Models 23 

Remarks. . 24 

IV. ILLUSTRATIONS FROM TOE CAI LITERATURE .... 28 

Remarks 31 

REFERENCES 33 



ERiC 



I 



LIST OF TABLES 



Table 



Page 



■ 1. Main Types of Criteria for Assessing the Suitability 

of a Program 2 

2. Average Grade -placement Scores on the Stanford 

Achievement Test: California, 1966-67 28 

3. Average Grade-placement Scores on the Stanford 

Achievement Test: California, 1967-66 29 

A. Learning snd Test Scores for Three Experimental Treatment 

Groups 31 



LIST OF FIGURES 



Figure 



£age 



1. Classical four-group design for evaluating training 

effects IS 

2. Action alternatives, likelihoods, and layoffs in simple 

decision model 22 

3. Portion of a model simulating a department store buyer's 

stock ordering behavior 25 

Kelley snd Prosln’a adaptive measurement model 26 

5. Student performance for the portion of the fall quarter 
final examination In first-year Russian that vas common 
to the computer-based and regular sections 30 



O 




111 - 



I 



MEASUREMENT OF TRAINING OUTCOMES 
SECTION I. INTRODUCTION 

When someboJy says "how effective Is training program X," he can be 
asking about several aspects of X. He may want to know whether the X 
material covers the subject matter domain which Is to be taught, whether 
It does actually teach whatever It Is supposed to teach, whether It Is 
a practical program, and so forth. Ihus "effectiveness" Is apt to be a 
multidimensional concept with many ramifications. Consider one of 
Lumsdalne's tables (Table 1), which shows the kinds of Internal and 
external criteria that could be used In evaluating teaching programs, 
(Lumsdalne, 1965). 

In this report, we are mainly concerned with those external criteria 
that Lumsdalne subsumes under his "effectiveness" category that Is, 
with those Items of Information which show how well the teaching objec- 
tives are realized In the students receiving the treatment. At a few 
places we do touch upon "appropriateness" and "practicality" matters, 
there are three *reae for us to consider In this Introductory report: 

(1) the factors Involved In deciding upon specifi c performance criteria, 

(2) the selection of some compariso n design for showing effectiveness, 
and (5) some examples from the training evaluation literature. We cannot 
provide here a cookbook to solve local decisions of scoring and design; 
what we can do Is to raise some of the Issues that might give a basis 
for such decisions. It turns out that, although training effectiveness 
studies have been rather conventional so far, evaluation schemes derived 
from adaptive control and from decision theory show some promise for the 
CAI training Mnager. 




1- 



Main Type* of Criteria for Assessing the Suitability of a P: 



44 



1 



03 

w 

o 

03 

f-t 

•6 

d 

U 

O 



to 

4> 

3 

w 

o 

4> 

•n 

o 

•o 

J 

u 



B 

II 



$ 

44 V* 
°4, 

s ° 

s 1 

_ _ 8.3 

rt » s *3 

3 £ <3 <5* 



u 

3 

4J 

a 

8 



</> 




SSIJI s 



I 






i 



o 

0) 



2 

■8 

r-4 

O 

0 



Er a 

8 ® 

0) 0) 

■ fl 5 

1-4 H 

*s 



U 

O 3 
M O 

s & 

U 



I 

0 



U 

O 



d 

o 



I 



«m e 

tJ 0“ « 

A U a i 


44 v4 

° g -o 


w H W 

i 


03 *H 

v u i 


fl 03 10 u 
v4 3 


s g t 


v w 

d d jo c 


W M V 

d U 44 



£ 



d 

X 

« 



u 

u 

I 

u 

u 

•d 



s 

TS 

1 



I 

o 

a 

4> 

A 

u 



X 

u 

f4 

3 

8 

Ai 

2 

M 

£ 

W 



n 
u 
o 
« 

44 
44 
4J 

U 
4) U 

3 2. 

* a 

r 

-9 

u & 

S& 

*H CL 



5 

u 

•*4 

M 

U 



ii 

<3 

*4 

f-4 

d 

> 






03 



ERIC 



Practicality : Esse of using Cost factors: program price 

Reusability adaptability, characteristic: 

Machine (instrumentation) of presentation, machine (if 

requirements required) 



